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Abstract 

This  paper  deals  with  the  problem  of  selecting  the  best  population  among  k  popu¬ 
lations  belonging  to  the  same  class  of  exponential  family  distributions  through  sequential 
subset  selection  approach.  We  desire  that  the  best  population  should  be  selected  and 
each  selected  population  should  be  good.  Based  on  the  modified  likelihood  ratio  of  the 
conditional  frequency  function  of  some  statistics,  an  elimination-type  sequential  subset  se¬ 
lection  procedure  is  proposed.  This  sequential  procedure  achieves  the  selection  goal  with 
guaranteed  probability  at  least  P*  for  some  prespecified  value  P*.  At  each  stage,  this 
procedure  also  provides  some  statistical  inference  about  an  upper  bound  on  the  measure 
of  separation  from  the  u  lknown  best  population  to  each  remaining  contending  popula¬ 
tion.  Finally,  a  modified  sequential  procedure  to  select  a  good  population  is  also  studied. 
This  modified  sequential  procedure  achieves  the  goal  of  selecting  a  good  population  with 
guaranteed  probability  at  least  P*.  'Vp.  _ - 
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On  A  Sequential  Subset  Selection  Procedure 
For  Exponential  Family  Distributions 

1.  Introduction 

Consider  designing  and  analyzing  an  experiment  for  comparing  k  populations 
ttj  , . . . ,  7Tfc .  Suppose  that  observations  can  be  obtained  from  the  k  populations  sequen¬ 
tially.  It  is  often  desirable  to  terminate  sampling  from  a  population  as  soon  as  there  is 
statistical  evidence  that  it  is  not  the  best  population,  and  this  population  is  eliminated 
from  further  consideration.  Selection  through  sequential  comparison  with  elimination  pro¬ 
vides  a  significant  advantage.  To  achieve  a  certain  accuracy,  it  requires,  on  the  average, 
substantially  fewer  samples  than  the  fixed  sample  size  procedures. 

In  sequential  selection  and  ranking  procedures,  contributions  have  been  made  to  select 
the  best  population  by  using  the  indifference  zone  approach.  The  simplest  formulation  of 
the  indifference  zone  approach  is  the  situation  where  one  may  wish  to  select  only  a  single 
population  and  guarantee  with  a  prespecified  probability  that  the  selected  population  is 
the  best  population  provided  some  other  condition  on  the  parameters  is  satisfied,  usually 
an  indifference  zone.  However,  in  the  real  situation,  it  is  hard  or  not  possible  to  specify 
such  a  condition.  Thus,  a  reasonable  and  useful  approach  is  to  derive  some  sequential 
selection  procedure  to  select  a  small  subset  containing  the  best  population.  Further,  it  is 
desired  that  each  of  the  selected  populations  should  be  not  far  from  the  best  population. 
Therefore,  some  statistical  inference  is  needed  to  assert  that  each  selected  population  is 
within  some  prespecified  distance  from  the  best  population. 

In  this  paper,  we  assume  that  the  random  observations  from  population  7r»  has  a 
frequency  function  f{x\9{)  of  the  form 

f{x\Ot)  =  exp{Q(^)P(i)  +  R(x)  -  /3{0l)}.  (l.l) 

Here,  P( x)  and  R(x)  do  not  involve  the  parameter  9 ,  while  Q[6)  and  0(9)  do  not  depend 
on  x.  The  function  Q{6 )  is  assumed  to  be  a  continuous,  strictly  increasing  function  of  9. 

Define  6XJ  =  Q(0t )  -  Q(0j)  as  a  measure  of  separation  from  7r,  to  n j.  This  particular 


separation  measure  has  been  considered  by  Bechhofer,  Kiefer  and  Sobel  (1968).  We  call 
irt-  the  best  population  if  tt,-  is  the  unique  population  such  that  Q{6<)  =  max  Q(0,).  If 

i<j<fc 

more  than  one  population  has  this  property,  one  of  them  is  tagged  and  considered  as  the 
best  population.  Since  the  function  Q{6)  is  increasing  in  0,  the  population  associated 
with  0(fc)  =  ma x^0y  is  the  best  population.  We  let  ( k )  denote  the  index  of  the  best 
population  and  denote  the  best  population  by  ^(k)-  We  note  that  a  sequential  procedure 
for  selecting  the  best  population  from  among  exponential  family  distributions  through 
the  indifference  zone  approach  has  been  derived  by  Hoel  and  Mazumdar  (1968).  Perng 
and  Groves  (1977)  have  also  derived  two  sequential  procedures  for  partitioning  a  set  of 
one-parameter  exponential  populations  with  respect  to  a  control.  Since  it  is  assumed 
that  there  is  no  prior  information  about  the  possible  configurations  of  the  measures  of 
separation  =  l,...,k,t  ^  j,  thus,  subset  selection  approach  is  more  appropriate 

here.  However,  for  the  usual  subset  selection  approach  (for  example,  see  Gupta  (1965)), 
the  size  of  the  selected  subset  is  a  random  number,  and  the  quality  of  each  of  the  selected 
populations  cannot  be  guaranteed.  In  the  fixed  sample  size  case,  Gupta  and  Santner  (1973) 
and  Santner  (1975)  introduced  restricted  subset  selection  procedures  in  which  the  size  of 
the  selected  subset  is  at  most  m,  where  1  <  m  <  fc  —  1.  In  this  paper,  we  try  to  control 
the  quality  of  the  selected  populations. 


For  a  prespecified  positive  constant  6* ,  we  say  that  population  tt,  is  good  if  Q (#(*))  — 
Q(0i)  <  6*  and  bad  otherwise.  Let  S  denote  the  selected  subset  and  CS(S')  denote  the 
event  that  €  S  and  6(k)i  <  for  all  7r,  6  S.  We  desire  a  sequential  subset  selection 
procedure  P  such  that 

P0{CS{6*)\P}  >  P*  forall0en  (1.2) 


where  P*  (k  1  <  P*  <  1)  is  a  preassigned  probability  level  and  fl  =  {0  =  (0j, ... ,  0*)(/(x|0,-) 
is  well  defined,  i  =  1  ,...,£}  is  the  parameter  space. 


For  this  particular  measure  of  separation  6,y,  we  consider  an  appropriate  transforma¬ 
tion  of  the  random  observations  taken  from  any  two  populations.  With  this  transformation, 
the  likelihood  function  of  the  new  statistics  can  be  factored  into  two  pc'  ts,  one  of  which, 
obtained  by  a  conditional  argument,  and  termed  the  conditional  likelihood  function,  is  a 

S  n 


function  only  of  the  parameter  of  interest.  Based  on  this  conditional  likelihood  function, 
a  sequential  subset  selection  procedure  is  derived.  This  sequential  subset  selection  pro¬ 
cedure  achieves  the  selection  goal  described  above.  At  each  stage,  it  also  provides  some 
statistical  inference  about  the  bounds  of  separation  between  each  remaining  population 
and  the  unknown  best  population.  Finally  a  modified  sequential  procedure  to  select  a 
good  population  is  also  studied.  This  modified  sequential  procedure  achieves  the  goal  of 
selecting  a  good  population  with  guaranteed  probability  at  least  P* . 

2.  Some  Properties  Associated  with  Exponential  Family  Distributions 

Let  X{n  denote  the  nth  observation  taken  from  population  fly.  It  is  assumed  that 
the  random  observations  Xtn,n  =  1,2, .. .  are  independently  distributed  with  the  common 
frequency  function  f(x\6i),i  =  1  ,...,£.  For  each  pair  (t,j),l  <  i,j  <  k,i  ^  j,  let 

WiM  =  (P(X,„)  -  P(X. j„))/2,  Z„(n)  =  (P(AT,„)  +  P(X,„))/ 2.  (2.1) 

Then,  (W<y(n),  Z,y(n))  have  a  joint  frequency  function 
g(w,z\0i,6j)  =  cexv{6ijW  +  ( Q(0i )  +  Q(0,))z  +  a(z,w)  -  /?(0.)  -  (3{6j)}IA{w,z),  (2.2) 
and  given  (n)  =  z,  the  conditional  frequency  function  of  W,-y(n)  is 

g{w\z,6xj)  =  exp {6i}-w  +  a(z,w)  -  z,6i})}IA(w,z ),  (2.3) 

where 

r  (a)  a(z,  w)  —  R(z  +  w)  +  R(z  —  w), 

(b)  'F(M)  =  lo8  fweA,  exp{6w  +  a(z,w)}dw, 

<  (c)  The  set  A  is  the  support  of  the  random  vector  (IFty(n),  Z„(n))  (2.4) 

and  Az  =  { w\(w,z )  G  A},  and 

(d)  c  is  a  constant  such  that  g(w,z\6i,6j)  is  a  frequency  function. 

Note  that  for  each  z ,  (w,  z)  6  A  iff  (—tv,  z)  £  A.  Thus,  w  E  Az  if  and  only  if  —  w  E  Az- 
Also,  a(z,  u>)  =  o(z,  —  iy).  Hence,  by  the  definition  of  ^(z,6),  ^(z,  <5)  =  ^(z,  —  6). 


From  (2.3),  it  is  clear  that  g(w  Mi)  has  exponential  family  distribution  anv.  hence 
has  monotone  likelihood  ratio  property.  Therefore,  Eg\h(Wij(n))\Zij(n)}  is  nondecreasing 


in  6  whenever  h(w)  is  any  nondecreasing  function  of  w.  Also,  from  the  usual  theory  for  the 
exponential  family  distributions,  for  given  Zij[n )  =  2,  the  conditional  mean  and  variance 
of  Wij(n),  for  any  So,  are: 


2«.|W<i(»)  |Z„(n)  =  2|  =  &*(*,*) 


=  *«(*,*>), 


s=s0 


Var«0  {Wij  (n)  |  Z,j  (n)  =  z)  = 


=  Vss  {*,  bo)  >  0. 


6—6q 


(2.5) 


The  following  lemma  can  be  obtained  easily  based  on  the  above  discussion. 


Lemma  2.1.  For  each  z  belonging  to  the  domain  of  the  random  variable  Z,y(n), 

(a)  ^(2, 6)  =  W(z,  —  S)  from  all  6. 

(b)  $f(z,S)  is  a  convex  function  of  6. 

(c)  ^(z,5)  is  strictly  increasing  (decreasing)  in  6  for  all  6  E  [0,  oo) 

(5  E  (—oo,0]),  and  ^(2,0)  =  inf  ^(2, 6). 

6 

(d)  ^6(2, 60)  is  strictly  increasing  in  bo.Vt'g^.O)  =  0  and  ^5(2, -60)  =  ~*I>fi(z,6o). 

(e)  Let  ai,a2,bx  and  b2  be  points  such  that  a\  <  a2  <  b2,ay  <  bi  <  b2. 

Then 

’  61  — ai  —  63  —  aj 

(f)  For  each  fixed  2,  define 

I{b\z)  =  bV6{z, 6)  -  ^(2,6)  +  ^(2,0).  (2.6) 

Then,  I{b\z)  >  0  for  all  b  and  /(6| 2)  >  0  if  b  ^  0. 


3.  A  Sequential  Subset  Selection  Procedure  P(H,S*) 

Let  S*  >  0  be  the  prespecified  value  used  to  define  the  event  CS(S*).  Let  H(y)  be  a 
distribution  function  satisfying  Condition  A. 


Condition  A:  H(y)  is  a  distribution  function  defined  on  the  interval  [0,6*]  such  that  for 
some  constant  cq  E  (0,6*),  the  interval  (0,  Co]  is  contained  in  the  support  of  H. 


For  each  a  >  0,n  >  1  and  each  pair  (*,  j),  define 

So  11  9{Wij  (m) \Zij  (m) ,  y)  dH  (y) 

Li,(n,a)  =  - S=1 - ,  (3.1) 

m=  1 

where  </(ui|z,6)  is  the  conditional  frequency  function  defined  in  (2.3).  We  now  define  a 
sequential  subset  selection  procedure  P(H,6*)  as  follows: 

Let  So  =  { 7r i , . . .  ,7Tfc}.  For  each  n  >  1,  define 

k  -  1 

Sn  =  {tt*  €  Sn_i|LJt(n,0)  <  -j—-— ■  for  all  n j  6  5n_i  -  {7r,}}.  (3.2) 

That  is,  Sn  is  the  set  of  contending  populations  up  to  stage  n.  At  stage  n,  population 

n i  €  Sn  is  labelled  as  good  if  Lij(n,6*)  >  for  all  nj  €.  Sn  —  {tt,}.  Let  |5n|  denote 

the  size  of  the  set  Sn.  If  either  |Sn|  =  1  or  all  the  populations  in  Sn  have  been  labelled  as 

good  then  the  procedure  terminates  and  we  take  S  =  Sn  as  the  selected  subset;  otherwise, 
we  go  to  the  next  stage.  The  procedure  is  continued  in  this  way. 

Termination  with  Probability  One 

In  order  to  apply  the  sequential  selection  procedure  P(H,6*),  we  need  to  assert  that 
this  procedure  terminates  with  probability  one.  As  described  above,  the  distribution  func¬ 
tion  H{y)  is  chosen  to  satisfy  Condition  A.  It  is  also  assumed  that  Condition  B  holds  true, 
where 

Condition  B:  ^55(2,6)  <  M[b )  a.e.  (Z,y(n))  for  each  b  >  0,  where  M(-)  is  bounded  on  the 
interval  [a,  00)  for  each  a  >  0.  That  is,  there  is  a  finite  function  g(-)  such  that  M(b )  <  q[a ) 
for  all  6  >  a  >  0. 

It  is  not  hard  to  verify  that  many  exponential  family  distributions,  including  normal, 
exponential,  binomial,  satisfy  Condition  B. 

To  prove  the  procedure  P(H,  6*)  terminating  with  probability  one,  it  suffices  to  show 
that  for  any  two  populations,  say  and  772,  with  probability  one  the  event  E  occurs, 
where  E  is  the  event  that  either  one  of  them  will  be  eliminated  (in  comparison  with  the 


other)  or  both  of  them  are  labelled  as  good.  Without  loss  of  generality,  it  is  assumed  that 
01  >  02.  For  simplicity,  in  the  following  of  this  section,  we  let  k  =  2  and  6  =  ( 0i,02 )• 


Defining  stopping  times  T\  and  T2  as  follows: 

j  i  =  max{rtJ ,  t ,  j  =  l,2,i  ^  j}  (3*3) 

where 

Tij  =  min  jn|L(y  (»,£*)  >  ^—^7 1  ,  (3.4) 

and 

T2  =  min  |njL12(n,0)  >  ^  |  .  (3.5) 

That  is,  Ti  is  the  random  time  at  which  both  7rj  and  ic2  have  been  labelled  as  good 
(assuming  no  elimination)  and  T2  is  the  random  time  at  which  n2  is  eliminated  by  Tr\ 
(assuming  that  n2  cannot  eliminate  -K\  and  there  is  no  labelling  for  good).  Note  that 
{Ti  <  00  or  T2  <  00}  C  E.  Thus,  it  suffices  to  prove  that  Pq{T\  <  00  or  T2  <  00}  =  1  for 
all  possible  configurations  of  the  parameter  0. 

We  have  the  following  two  lemmas.  The  proofs  are  given  in  the  Appendix. 

Lemma  3.1.  Under  Conditions  A  and  B,  Pq{T2  <  00}  =  1  if  6\  >  02. 

Lemma  3.2.  Under  Condition  A,  Pq{Ti  <  c»}  =  1  if  6\  =  02. 

Thus,  one  can  see  under  Conditions  A  and  B  that  Pq{Ti  <  00  or  T2  <  00}  =  1 
for  all  possible  configurations  of  the  parameter  0.  Hence,  under  Conditions  A  and  B,  the 
sequential  subset  selection  procedure  P(H,6*)  terminates  with  probability  one. 

4.  Probability  of  A  Correct  Selection 

For  each  n  >  1,  let  /,y(n)  denote  the  cr-field  generated  by  the  random  variables 
(W,y(m),  Z,;(m),m  =  l,2,...,n).  Then,  we  have  the  following  lemma. 

Lemma  4.1.  EQ{LXJ(n,6]t)\7lJ(n  -  1)]  =  Li}{n  -  for  all  n  >  1  and  9  e  Cl. 


Proof:  Note  that 


Eff\Eij{n,6ji)\?ij(n  1)] 


=  Eg  J  exp  j  (y  +  6ji)  J3  W'y(m)  -  J3  I*(2<y(m),y)  ~  Mzii(mh6a)\  |  dH(y)\ ZAn  ~  x) 

.  ®  v  m=l  m=l  ) 

r 6m  (  n-1  n-l  'j 

=  /  exp  •{  (y  +  Sji)  wii(m)  ~  J3  [*(3ij(m)»y)  “  \  Aij(9,y,n)dH(y), 

J°  l  m= 1  m=l  J 


where  A,y(0,y,n)  =  ^[exp{(y  +  6y,)W,y(n)  -  ^(Z,;(n),y)  +  ^(Zjy(n),5y,)}]  and  the 
second  equality  is  obtained  from  Fubini’s  theorem  and  by  the  independence  between 
(Wij(n),Zij(n))  and  J,y (n  -  l).  Now, 


Aij{9,y,n)  =  Ee  [£5o.[exp{(y  +  $yt)Wyy(n)  -  4(2tj(n),y)  +  ^(Ziy(n),<5yt)}|Zly(n)]]  =  1 


and  therefore,  the  result  follows. 


Theorem  4.1.  Let  {Sn}  be  the  sequence  of  the  sets  of  contending  populations  determined 
by  the  sequential  subset  selection  procedure  P(H,6*)  through  (3.2).  Then,  for  all  0  6  fl. 


k  -  1 


p0{£»(k)(n^(fc),i)  <  i  l~p*  for  a11  7r‘  €  “  {*&)}  for  a11  n  ^  ^ 


provided  that  Conditions  A  and  B  hold. 


Proof:  Note  that  for  any  0  €  fl. 


k  —  1 


P0{Li{k)(n,6{kU)  <  _  for  all  Tiy  €  5n-i  -  {*(*)}  for  all  n  >  1} 


A:  —  1 


=  1  -  Pg{Lnk)(n,6(k)ti)  >  — —  for  some  tt,  €  S„_i  -  {7r(fc)}  for  some  n  >  1} 


>  1  -  53  ^{^.(fc)(n^(fc),.)  >  ^  p,  for  some  n  >  1} 


By  Lemma  4.1,  {Z,,(fc)(n,  6^)^),  J,(k)(n);n  >  1}  forms  a  nonnegative  martingale. 
Thus,  from  a  lemma  of  Robbins  and  Siegmund  (1973),  for  i  *(*). 


k  _  i  i  -  p* 

*W^(fc)(M(fc).*)  >  t  _P,  for  some  71  >  !}  < 


>*>  rt  iTwrl 


W  «  -  *  _  >4  .  1 


»W 


nxmwxv* 


umwwwwwwwwwwwilff  vwwimmj'.k'  •Awv'M'Jv-jsnntxnm 


Then  by  (4.1)  and  (4.2),  the  result  follows. 

For  each  pair  (t ,j)  and  a  >  0,  let  Bty(n,a)  denote  the  event  that  Lt-y(n,a)  <  . 

That  is  Bij(n,a)  =  {Lij(n,a)  <  }.  The  following  result  is  very  helpful  for  obtaining 

some  sequential  estimate  of  an  upper  bound  on  £(*),»,*  =  1,2, .. .  ,k. 

Lemma  4.2.  Let  a  and  b  be  any  two  nonnegative  values  such  that  b  >  a  >  0.  Then, 

Bij(n,b)  C  Bij(n,a)  for  all  n  >  1.  (4.3) 


Proof:  Suppose  that  the  statement  of  (4.3)  is  not  true.  Then,  there  exists  some  n  >  1 
such  that 

k  -  l 

Lij{n,b)  <  1'^-p7  <  Lij{n,a).  (4.4) 

By  (3.1),  we  have 

n  n 

L{j(n,  6)  =  L,y(n,a)  exp{(6  -  a)  Wiy(m)  +  [^(Zt>(m),6)  -  ®(Zy(m),o)]}.  (4.5) 

m—  1  m=l 

(4.4)  and  (4.5)  together  imply  that. 


^2  Wi^m)  <  — ]T  [^(Zty(m),6)  -  tf(ZtJ(m), a)]. 


m=l 

Then,  by  (4.4)  and  (4.6), 
k  -  1 


b  —  a 


(4.6) 


m=  1 


i  < 


l  -  p* 

<  Lij(n,  a) 
s 


=  /  exp  j  (y  +  a)  ^  W'iy(in)  +  [*(Z,y(m),  a)  -  ^(Zt;(m),y)]  >  dtf(y)  (4.7) 

®  v  m=l  m=  1  J 

<  J  exp  |  —  (y  +  a)  ^  Qij(m,y,a,  b)  j  dH (y), 


where 


^  ^(ZtJ(m),y)  -  ^(Z,y(m),a)  ^(Z0(m), -a)  -  ^(Z,;(m), -6) 

Vo(m,y,a,6)  =  - 7 - r - 7 - ; - r~ 7\ - 

y-  -a)  -a  -  -6 
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By  Lemma  2.1  and  the  fact  that  —b  <  —a  <  y  for  each  y  E  [0,6*],  we  have,  for  each 
»?i  =  1,2, ...  ,n,  Qij{m,y,a,  b)  >  0.  With  this  fact  and  from  (4.7),  we  obtain  1  <  fj-p-.  < 
Lij(n,a)  <  1,  which  is  a  contradiction.  Therefore,  Bt;(n,6)  c  B,-y(n,a)  for  all  n  >  1  when 
b  >  a  >  0. 


An  immediate  result  of  Lemma  4.2  is: 

n  n 

n  B<AmM c  n  Bij(m,a)  for  ail  n  >  1  whenever  b  >  a  >  0. 


For  each  n  >  l,7r,, 7T,  E  Sn-i,i  ^  j,  define  Cxj(n)  and  D,;(n)  as  follows: 


k  _  1  ) 

a  >  0|LtJ(n,a)  <  x  _  p<  |  • 

(4.9) 

r  sup  Cij(n)  if  Cij(n )  ^  <f>, 

L0  if  C,j(n)  =  <f>, 

(4.10) 

where  <f>  denotes  the  empty  set.  Also,  let  Da(n)  =  0.  From  Lemma  4.2,  if  Dij(n)  >  0, 
then,  Lij(n,a)  <  ySp-  for  all  a  E  [0,  Dij(n))  and  L,y(n, 6)  >  for  all  b  >  Di:(n). 

For  each  n  >  2,  if  7T*  E  S„_  i,  define 


Di{n)  =  min  (  max  Ay(m)). 

1  S  mS  n  —  1 


(4.11) 


If  7r,-  £  5n_j,  let  n,-  =  max{m|7r,-  G  5m_i}  and  £>,(n)  =  L>,(nt). 


By  the  definition  of  L\(n),  for  each  i  =  1, . . . ,  k,  {Z>,(n)}  is  a  nonincreasing  sequence 
and  bounded  below  by  zero.  The  value  Z?,(n)  will  be  used  at  stage  n  as  an  estimator  of 
an  upper  bound  of  6 (*),-.  We  note  that  the  technique  used  to  define  D{(n)  is  from  Hsu  and 
Edwards  (1983)  for  location  parameter  model. 


Theorem  4.2.  Let  Lv(n,a),  Sn  and  Z?,(n),n  >  1  be  those  defined  in  (3.1),  (3.2)  and  (4.1), 
respectively.  Then 


k  —  1 


{Lx(k )K <5(fc)«)  <  1  _  p,  for  all  7 rt-  e  i  -  {7r(fc)}  for  all  n  >  1} 
c{7r^)  E  S  and  6^)i  <  £>,-(n )  for  all  7rt  E  Sn-i  for  all  n  >  1}. 


Proof:  By  Lemma  4.2  and  the  fact  that  6^)i  >  0,  we  have 

k  —  1 

{Li (*)(n, $(*),,•)  <  —  p~  for  all  t r,-  6  -  {7r(Jk)}  for  all  n  >  1} 

k  -  1 

C {£,•(*)  (n,0)  <  -  —  —  and  £(*)<  <  A  (*)(»)  for  all  €  Sn-i  -  {7T(fc)}  for  all  n  >  1} 
c{7T(fe)  G  5  and  £(*),-  <  £>,(*.)(«)  for  all  i r,-  6  S„_i  -  {*■(*)}  for  all  n  >  1}  (4.12) 

C{?T(fc)  £  &  and  £(*)«  S  max  Dij(n)  for  all  tt*  G  5n_i  for  all  n  >  1} 

fy€Sn_  i 

={7r(Jfc)  €  5  and  £(*),•  <  A(n)  for  all  7rt-  €  Sn- 1  for  all  n  >  1}. 

An  immediage  consequence  of  Theorems  4.1  and  4.2  is  that  for  all  0  G  Cl,  under 
Conditions  A  and  B. 

Pff{^(k)  G  S  and  6(fc)t  <  Di(n )  for  all  n ,•  G  Sn_i  for  all  n  >  1}  >  P* .  (4-13) 

This  result  provides  a  sequential  comparison  inference,  with  confidence  level  at  least  P", 
as  follows:  simultaneously,  at  each  stage  n,  the  best  population  is  not  eliminated  and  the 
separation  from  the  unknown  best  population  to  each  contending  population,  say  *{,  up  to 
stage  n,  is  not  larger  than  the  value  Z?,  (n )  for  all  n  >  1.  Another  consequence  of  Theorems 
4.1  and  4.2  is  that  when  the  sequential  selection  procedure  P(H,6*)  terminates,  the  event 
of  a  correct  selection  CS(6‘)  is  guaranteed  with  probability  at  least  P*.  We  state  this 
result  as  a  theorem  as  follows: 

Theorem  4.3.  Let  P(H,6*)  be  the  sequential  subset  selection  procedure  defined  in  Section 
3.  Then,  when  the  selection  procedure  P(H,6*)  terminates,  that  Pq{C  S{6*)\P(H,6*)}  > 
P*  holds  for  all  0  G  Cl. 

Proof:  Note  that  when  the  sequential  selection  procedure  P(H,6*)  terminates,  then  either 
|5|  =  1  or  all  the  populations  in  S  must  have  been  labelled  as  good  at  some  stage.  Let 
N  be  the  stopping  time  of  the  sequential  selection  procedure  P(H,6*)  and  when  |S|  >  2, 
for  each  7r,  G  S,  let  N,  denote  the  first  time  at  which  7r,  was  labelled  as  good.  Hence, 
Lij{N»6  *)  >  ^or  G  $Ni  —  {^t}-  By  the  definition  of  Z?tJ(n)  and  Lemma  4.2, 
Dij(Ni)  <  6’  for  all  n}  G  S;v,  —  {7r,},  and  thus,  <  6 *  if  7T(fc)  G  5jvt  —  {7r<}.  Also, 


note  that  S  =  Sn  and  when  |£(  >  2,  jV,  <  N  for  all  7r»  G  S.  Now 

{*■(*)  £  S  and  <*(*),•  <  A(*)(»)  for  all  tt,-  G  Sn_i  -  {*-(*)}  for  all  n  >  1}  (4.14) 

c{7T(jt)  €  S  and  |5|  =  1}  U  {?T(fc)  G  S,\S\  >  2  and  £(*),■  <  A(fc)CWi)  for  all  tt,-  G  5  —  {7T(fc)}} 
(since  S  C  Sn_i  for  all  n  and  Ni  <  IV  for  all  7r,-  G  5) 
c{7r(jt)  G  S  and  |S|  =  1}  U  €  5,  |5|  >  2  and  <  6*  for  all  tt,-  g  S  —  {^(fc)}} 
=CS{6*). 

Then,  by  (4.14),  (4.12)  and  Theorem  4.1,  we  have  that  for  all  0  £  fi, 

P0{CS(6*)\P(H,6*)}>P\ 

5.  A  Sequential  Procedure  for  Selecting  a  Good  Population 

Consider  a  selection  problem  among  k  populations.  In  most  applications,  an  exper¬ 
imenter  is  usually  content  with  selecting  a  good  population.  Kao  and  Lai  (1980)  has 
studied  the  problem  of  selecting  a  good  population  among  k  normal  populations  through 
sequential  approach.  For  our  problem,  with  this  selection  goal,  what  we  want  is  in  fact  a 
selection  procedure  having  the  property  that 

P0{^i  is  selected  and  6(k)i  <  £*}  >  P*  for  all  6  £  Cl.  (5.1) 

Also,  the  procedure  should  stop  as  soon  as  we  are  confident  that  a  good  population  has 

been  found.  A  sequential  selection  procedure  achieving  this  selection  goal  can  be  obtained 
from  P(H,S *)  with  a  little  modification  on  its  stopping  strategy.  We  now  describe  it  as 
follows. 

Let  {SVi}  be  the  sequence  of  sets  of  remaining  populations  defined  in  (3.2).  For  each 

7Tf  £  Sn,  7r,  is  labelled  as  good  if  Lij(n,6*)  >  for  all  i ry  £  Sn  —  {tt,}.  The  procedure 

terminates  at  stage  n  els  soon  els  one  of  the  populations  in  Sn  has  been  labelled  as  good. 
If  there  is  only  one  population  in  Sn  having  been  labelled  as  good,  we  then  select  this 
population  as  a  good  population.  If  more  than  one  population  have  been  labelled  as  good, 


we  usually  select  the  one  with  the  smallest  Di  (n)  value.  We  denote  this  modified  sequential 
selection  procedure  by  P' . 

The  following  theorem  guarantees  the  desired  confidence  level  of  selecting  a  good 
population  by  applying  the  selection  procedure  P'. 

Theorem  5.1.  Let  P'  be  the  sequential  selection  procedure  defined  above.  Then,  Pq{ki  is 
selected  and  <  <S*| P'}  >  P*  for  all  6  G  0,  provided  Conditions  A  and  B  hold. 

Note:  The  argument  for  the  proof  of  this  theorem  is  similar  to  that  of  Theorem  7  of  Kao 
and  Lai  (1980).  For  completeness,  the  proof  is  given  as  follows. 

Proof:  Let  A{6 )  =  {Lt(*)(n,  6k^))  <  for  all  7r,  G  Sn-i  -  {^(Jt)}  for  all  n  >}.  Then  by 
Theorem  4.1,  Pg{A(0)}  >  P*  for  all  6  G  0.  We  note  that  on  the  event  A(0),  by  Lemma  4.2, 

L, (fc)(n,0)  <  jzjh  for  all  n ,•  G  -  {^(fc)}  for  all  n  >  1.  Therefore,  can  never  be 

eliminated  in  comparison  with  other  populations  at  any  stage.  Let  B(6)  =  {7r,-| $(.<,■),■  >  5*}. 
That  is,  B{6)  is  the  set  of  bad  populations.  It  suffices  to  show  that  on  the  event  A(6),  any 
population  in  B(&)  cannot  be  labelled  as  good  when  the  procedure  P'  terminates. 

Let  M  be  the  stopping  time  of  the  sequential  selection  procedure  P' .  On  A[6),  since 
7T(fc)  can  never  be  eliminated  prior  to  the  stopping  time  M,  then  7T(jt)  G  Sn  for  all  1  <  n  < 

M.  Moreover,  for  each  7rt-  G  B(0),6(k)i  >  6*  >  0  and  on  A(6),Li(k)[n,f>(k)i)  <  '\Zp~  f°r 

all  1  <  n  <  M.  Then  by  Lemma  4.2,  L,(jt)(n,  <5‘)  <  which  means  that  7r,  cannot  be 

labelled  as  good.  Hence,  i r,-  cannot  be  selected  as  a  good  population.  Therefore,  for  all 
6  G  n,  we  have  P^{^i  is  selected  and  £(*),-  <  6*\P'}  >  P*. 

Appendix 

Proof  of  Lemma  3.1.  The  proof  is  analogous  to  that  of  Lemma  1  of  Poliak  and  Siegmund 
(1975).  Note  that  612  >  0  since  6 1  >  6 2.  For  convenience,  in  the  following,  we  let 
W (m)  =  W12(m),Z(m)  -  Zi2(m),  A(m,  a,  6)  =  V [Z i2(m) , a)  -  ^(^i2(m),6),<50  =  <5j2  and 
a  -  fjp.  •  Then, 

Li2(n,0)  =  f  exp  jy  ^  W(m)  -  A(m,y,0)|  dH(y). 


[JWUtl 


For  each  n,  define  T2n  —  nnn{T2,n}  —  1,  so  that  !T2fl  + 1  is  a  bounded  stopping  time. 

Case  1.  When  6n  <  cn  where  c0  is  given  in  Condition  A. 

From  the  definition  of  r2,  we  have 


log  a  >  logLi2(r2n,0) 

Tin  Tin 

=  So  £  W(m)  -  ^2  \(m,6o,0) 

m=l 

Tir 


m=l 

+  log 

Tin 

£ 
m=  1 

+ 


/„  “‘’I1 


(y  -  <5o)  £  W(m)  -  £  A("»>y,«5o)  >  dH[y)  (A.l) 

m= 1  m=l  J 

>S0J2  IWM  -  *«(Z(m),fio)]  -  £  [<50^(Z(m),^0)  -  A(m,<50,0)] 

i= '  m=l 

f  f  2a  „  'l 

log  /  ,  .  exp  1  ~6°)  £  -  £  AK^o)  l  dff(y). 


By  using  Taylor’s  expansion, 


A(m,y,60)  =  {y-60)V6{z,6o)  +  \{y  ~  ^o)2^^,  £) 


(A.2) 


where  £  =  £(*,y,60)  is  a  value  between  y  and  60.  By  Condition  B,  we  can  find  an  e  so 
small  that  \\ y  -  6o\V6s{Z{m),  £)  <  1  a.e.  for  all  y  £  («50  -  e,60  +  e).  We  then  obtain 

r  (  Tin  Tin  'j 

l0g  /  ,  ,  GXP  “  *>)  £  W(m)  -  £  Hrn,y,S0)  }  dH(y ) 

•/|y-50|<e  (  m=1  m=1  J 

~  IOg  I\y~60\<c  CXP  {(y  ”  5o)  S  l^H  ~  *,(W)I  ~  e  r2n|  dff(y)  (A3) 

Tin 

>  \ogH{60  ~e,60  +  e)  +  {d-  60)  £  [W{m)  -  «e(2(m),fi0)]  -  c  T2n, 

m=  1 

where  H(60  -  e,60  +  e)  =  Jly_Sul<edH(y),d  =  /|y_5o,  ydH(y)/H (60  -  e,S0  +  e),  and  the 
second  inequality  is  obtained  by  Jensen’s  inequality.  From  (2.6),  (A.l)  and  (A. 3),  we  have 


Tin 

£  \I(60\Z(m))  -  e]  +  d  £  [W(m)  -  ^(Z(m),<50)] 

m=l  m=l 

<  logo  -  log/f(50  -  £:,«50  +  e). 


(A.4) 


Therefore, 


Tan  +  l  Ti„  +  1 

E  |/(«o|Z(">))  -  *)  +  <*  E  [»'(m)-®l(Z(m),*)] 

m=l  m= 1 

<  log  a  -  log  H (So  -  £,  60  +  e)  +  I{60\Z[T2n  +  l))  -  e 
+  d[W{T2n  +  1)  -  tys{Z{T2n  +  l),«5o)]- 


Now  consider  the  expectation  Eg  of  both  sides  of  (A. 5).  Since 


Ee[\W{m)  -  *s{Z{m),60)}  =  Ee 


Z(m)} 


<\Ee 


then, 


£6o[|iy(m)-^(Z(m),60)|2|^(m)] 
=  {E0{*s(Z(m),6o)}}± 

<M*(60), 


Ee[W{m)  -  *s{Z(m),6o)}  =  Ee  [£fiolW(m)  -  *5(Z(m),£0)|Z(m)]i  =  0 


Also, 


0  <  Eg\I{60\Z(m)))  <  Ee\S0^s{Z[m),b0))  =  60Ee\W{m)}  <  cc. 


Thus,  by  Wald’s  lemma,  Eg 


T2n  +  1 


-  L  rn=l 


=  0,  and 


Ee 


Ty  4- 1 

E  |/(<g| Z(m))  -  e| 

,  m=l 


=  [£((r„  + 1)]  x  [%|/(*|7(i))|  - 1). 


Also,  by  Schwarz  inequality, 

Eg[W(T2n  +  1)  -  *s{Z{T2n  +  lMo)] 

<  Eg\\W(T2n  +  1)  -  *s(Z(T2n  +  1), 60)J] 

<  {Eg\{W{  1)  -  ^5(^(1), <50))2][^(T2n  +  1)]}- 
<Mi(«0)[^(T2n  +  l)]i 


% 

;8 

$ 

i 


V 


'A 

S' 

*2* 

| 

%!« 


Similarly, 


Therefore, 


and  so, 


Ed[I(60\Z(T2n  +  1)]  <  60Eg[<S/6(Z(T2n  +  l),tf0)] 

=  6QEe[W(T2n  +  1)] 

<  60{Ee_{W2(l)\Ee_[T2n  +  1]}*. 

{^[/(«ft|Z(l))]-e}x{^[ra«  +  l]} 

<{^o[^(^2(l))]^  +  dAf±(«50)}  X  {Eg  T2n  +  1}» 

-  log  If  (So  -  e,  So  +  c)  +  log  a, 

{£?|7(«o|Z(l))l  -  e}  x  {£?T,„  +  l}i 
<{tfo[^(W^2(l))]*  +  dMt(60)}  -  log  H (60  -  e,60  +  e)  +  log  a. 


M-6) 


Now,  since  7(60|Z(1))  is  positive,  e  can  be  chosen  so  small  that  Eq[I(6o)\Z(1)}  -  e  >  0. 
While  the  right-hand-side  of  (A.6)  is  independent  of  n,  so,  as  n  — ►  oo,  the  left-hand-side 
of  (A.6)  is  still  bounded.  Hence  Eq(T2)  <  oo  which  implies  that  Pq{T2  <  oo}  =  1. 

Case  2.  When  6n  >  cn.  From  (A.l),  we  also  have 

T?n  + 1  Tj„  +  1  Tan  +  l 

E  Hco\Z(m))  +  d  J2  [Wr(m)-^(^(m),«o)]+  £  {d{*s(Z(m),6o)  -  *(Z(m),c0) ] 

m  =  l  m=l  m=l 

<  log  a  —  \ogH(co  —  e,co  +  e)  4-  d[W(72n  +  l)  —  ^ e(Z(T2n  +  l),$o)] 

+  d[*s(Z(Tin  +  l),So)  -  *(Z(T2n  +  1), Co)]. 

Following  an  argument  analogous  to  the  above,  we  have  Eq[T2\  <  oo  and  so 
P${T2  <  oo}  =  1. 

Proof  of  Lemma  3.2.  Note  that  J: 2  =  0  since  0\  =  62.  Now, 

Ll2(n,6')  =  Jo  exp  j (y  +  61  E  +  E  j  dH{y)- 

By  Jensen’s  inequality,  we  have 


1 


n  1  log  L12(n,  <5 *)  >  n  1  E  Vm, 
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where  Vm  =  JQ  {(y  +  6*)W(m)  +  X(m,S*  ,y)}dH(y),  m  —  1,2, . . .  ,n,  are  iid.  By  Lemma 
2.1  and  the  fact  that  612  =  0,  then, 

f6' 

E6[Vm\  =  /  E0[*(Z(m),6 *)  -  ¥(Z(m),y)]dtf  (y) 

Jo 

>  f°  Ee[*(Z(m),6 *)  -  *(Z(m),y)]dtf(y)  (A.8) 

Jo 

>  r  Ee[*{Z{m),6*)  -  *( Z(m),c0)}dH(y ) 

Jo 

>  0. 

By  strong  law  of  large  numbers  and  by  (A. 7)  and  (A. 8),  we  obtain  lim  inf  n_1  log  Li2(n,  5*) 

n  — *  00 

>  /Qc°  E0\^>(Z{m),6*)  —  '&(Z[m),co)]dH(y)  a.e.  while  n~l  log  a  —*  0  as  n  — ►  00.  Therefore, 
Pq{T\2  <  00}  =  1  when  6\  —  02.  Similarly  Pq{T2  1  <  00}  =  1  and  thus  Pq{T\  <  00}  =  1 
when  6\  =  02- 
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